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ABSTRACT 

The results of an algorithm, which is designed to take" 
a set of commonality coe/.f ic^ients, either real or manipulated, and, 
if possible, produce one- or more sets of regressor correlations that 
are consistent with, them are ;examined« A number of different ways of 
resolving the higJier order co.m^nonality values into their lower orders 
were tried and the' number ^and nature of solutions generated from them 
were examined for their meaniHgf ulness and variability* In general it 
was found that this. could be a' meaningful exercise because it allows 
an analyst to test his assumptions about the nature of the 
confounding to see if absolution can be obtained. If one cannot be 
obtained then his assumptions must be revised. But if one can be 
obtained, then he can examine them to see^ which kinds of viriablf-s 
have the greatest sensitivity to the assumptions* The resultinxr 
output also allows one t^ gauge the variability of regression f 
coefficients that will satisfy the same set of commonality Values and 
the effectiveness of the regression system. This technique called 
"Reverse Commonality" is best suited for an interactive computing 
arrangement so that an analyst can use it rapidly in a sequential 
manner* Since, for a laifge; number of variables the sheer volume of 
commonality values become unmanageable and, since a number of 
difjEerent ways are available for resolving the confounding, an 
algorithm is needed which uniquely resolves the higher order values 
according to one's assumptions* (Author/DEP) 
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The .results of a "Commonality Analysis" yield measures of the 
extent to, which each variable contributes to the explanation of 
variance in the dependent variable independently of the other ^ 
variables in the anajtysis. Such an analysis also yields measures 
of the extent to which two or more of the variables share some of 
their explanatory power. This shared explanatory power* may arise 

from the functional interplay of the variables over time and/or 

/ 

merely from their joint occurrence. For example, one might observe 
that children from more affluent families tend to attend schools 
that are better staffed and tend to do better on standardized 
achievement tests than their less affluent counterparts. Such 
observations would produce a degree of correlation among these 
three classes of variables so that the effects of affluence and 
school type on achievement would be difficult to disentangle. 
However, for heuristic reasons an analyst might want to^ attempt to 
disentangle their "possible effects". This paper includes .one. such 
technique and some illustrative applications. 

We have called this technique "Reverse Commonality" because it 
takes as given a set of commonality coefficients, from them reproduces 
all possible squared multiple correlatipns (RSQ's) and then attempts to 
produce ont or more sets of regressor (independent) variable inter- 
correlations tlfl? are consistent with these RSQ's. When the commonality 



coefficients have been obtained from real data, one admissible set of 

regressor intercorrelations will be obtained which corresponds, 

within rounding error, to the actual or real data intercorrelations.. 

When manipulated coefficients are read in to the program^^ the computer 

will inform the analyst as to the number of admissible solutions. If 

any exist the algorithm will print out the regressor intc^rcorrelations, their 

standardized regression weights and the variance explained by the 

regression. 

Before proceeding with the details of the technique we shall 
develop the discussion around a three variable commonality analysis. 
Later, greater numbers of variables will be dealt with. 

Beaton (1973) has synthesized the work of other investigators on 
commonality and has extended and clarified its properties. Consequently, 
we shall deal only briefly here with its nature for the three variable 
cas.e. Given three regressor variables Xi, X2 and X3, their relationship 
with a dependent variable Y can be expressed in terms of all of their 
possible squared simple and multiple correlations with Y as follows: 
RSQ (XI); RSQ (X2); RSQ (X3); RSQ (X1X2); RSQ (X1X3); RSQ (X2X3); 
RSQ (X1X2X3). For m variables there are 2^-1 correlations that can 
be obtained. Commonality analysis merely forms sums or differences of 
these RSQ*s so that the Y variance explained by any Xi can be expressed 
In terms of the proportion it has in common with the other X's plus what 
it brings uniquely to the analysis.* These latter unique terms can be 
obtained as follows: 

■ 4 • ' 

*Be«ton (1973) has a matrix formulation of this computational rationale 
for the general c^se* 



■ (1) U(Xi) = RSQ (Xj^X2X3) - RSQ (X2X3) 

(2) U(X2) = RSQ (X1X2X3) - RSQ (X1X3) 

(3) U(X3) - RSQ (Xj^X2X3) - RSQ- (XfX2) 

The second order commonality coefficients are obtained by: 

(4) C(XiX2) » RSQ (X1X2X3) - RSQ (X3) - U(Xi) - U(X2) 

(5) C(XiX3) = RSQ (X1X2X3)'- RSQ (X2) - U(Xi) - U(X3) 

(6) C(X2X3) = RSQ (X1.X2X3) - RSQ (X^) - U(X2) - VQ^^) 
^rtiile ,the third order coefficient is liobtained by: 

(7) C(XiX2X3) = RSQ (X1X2X3) - C(XiX2) - C(XiX3) - CQi^X^) 
- U(X^) - U(X2) - U(X3) 

By virtue of this foregoing, the RSQ's for Xi, X2 and X^ 
can each be expressed as a sum of their commonality coefficients 
follows: 

(8) RSQ(Xi) = C(XiX2X3) + C(XiX2) + C(XiX3) + U(Xi) 

(9) RSQ(X2) = C(Xj^X2X3) + CiX^X^) + CQL^X^) + ViX^) 

(10) RSQ(X3) = C(XiX2X3) + C(XiX3) + C(X2X3) + U(X3) 
and the total RSQ is given by: 

(11) RSQ (X1X2X3) = C(XiX2X3) + C(XiX2) + C(X^X^) + C(^K^) 

+ u(x^) + ucx^)' + ucxg) 



i 
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Arranging these in tabular form we have 



C(XiX3) 

cCXjXj) 

C(XiX2X3) 



ia 



_2. 
b 



f 
g 



_3. 
c 



e 

f 



where the al-phabetic entries represent empirically observed values and the 
values in each colum.n sum to ttie RSQ for the variable number heading the 
column. For example, RSQ (X^) = a + d + e + g (note that the empirically' 

observed value, for a coefficient is repeated under its respective 

\ ■ 

column as appropriate). These empirically observed values can be 
manipulated in a jiumber of different ways and then submitted to the 
algorithm developed by Beaton and described in Appendix A, to see if 
they yield a consistent set of correlations. The remainder of this 
paper deals with the results of the systematic application of this. 
Reverse Commonality procedure. 

1. Three Variable Reverse Commonality for a Fixed Level of Explained 
Variation 

The results of a three variable commonality analysis are given in 
Table 1*1* The variables used here are composites taken from Mayeske, 
et. al. (1974) • The first variable is a weighted* sum of a student's 
values on his Socio-Economic Status and F^ily Structure indices as 
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* llie weights are taken from the first principal Component of the 
variables under consideration* 
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well as a value for his Ethnic Group* and Boy-girl group** membership* 
The second variable is a weighted sum of his scores on four attitudinal 
and motivational indices*** while the third variable is the average 
achievement of studenta in the grade level that one attends school with. 
These three variables are called respectively: Social Background (SB); 
MotivationrJ (MTVTIO ; and School (SCH) factors. The dependent variable, 



Y, is each student's score on an Achievement composite • 



t 



* Whites were scored highest, Oriental-Americans intermediate, while all 
others were scored low. 

**Glrls were" scored highest. x 

***Ihese are: Expectations for Excellence; Attitude Toward Life; Educational 
Plans and Desires; and, Study Habits* For details see Mayeske, et« al., 
1972. 

tsee Mayeske, et. al., 1972 for more details on the nature of this 
composite. 
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Table 1.1. - Coinmonality Analysis of Achievement With Social Background, 
Motivational and School Factors* 





1 


2 


3, 




SB 


MiviM 




U(Xi) 


.0236 


.0307 


.1149 


C(XiX2) 


*0654 


.0654 




C(XiX3) 


.1025 




. ,i025 


C(X2X3) 




-.00.71 


-.0071 


C (X^X^X^ ) 


.11-17 


. .1117 


.1117 


RSQ(Xi) ' 


.3032 


.2008 


.3220 


RSQ(X]^X2X3) 




.4417 





/ 



* There are 123,305 students and their 2370 schools included in 
these analyses. 

Given this table of coinmonality coefficients we may ask: '*How 
do the regressor intercorrelations produced from these coefficients 
compare with those actually observed"? The observed (0) values are 
given in the first row of Table 1.2 with their reproduced values (R) 
directly below them. \ 
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Table 1.2. - Observed (0) and ' Reproduced (R) Regressor Intercorrelations 

for SB, MTVTN and SCH and their Correlations With Achievement 



Social Background (SB) 0 



Motivation (MTVTN) 



School Factors (SCH) 



K 
0 
R 
0 
R 



SB 
1 



MTVTN 


SCH 


ACHIEVEMENT 


.5879 


.5219 


.5506 


.5879 


.5219 


■ .5506 


1 


.2623 


.4481 




.2623 


.4480 




1 


.5674 




1 


.5675 



Inspection of the values ihova them to be identical to the third 
decimal^ digit in every case and to the fourth decimal digit in almost every 
case with the errors occurring in the regressor-regressand correlations 
(i.e., Column 4). Clearly, then for the case of these tliree regressors, the 

algorithun reproduajs the correlations to a satisfactory 

j 

degree/ of accuracy. 

Next we may ask: "If the coefficients in Table 1*1 are martipulatcd 
in some simple manner, might they yield a consistent set of regressor 
correlations?" The simplest change that suggests itself is to get rid 
of the negative value* To do this we shall assume that it arose from the 
variable called MTVTN and its relationship with '^the other variables. 



•By adding the value of -•0071 to the second order coefficient for 
SB, MTVTN (viz., C(XxX2)) thereby reducing it to .0583, and setting 
'the C(X2X3) value to zero, we obta^in a simplified set of coefficients* 
The application of the algorithm showed that a single consistent set 
of intercorrelations could be produced which differed very little from 
the observed values. They are compared in Table 1*3 . 



Table 1.3. Observed 



(0) Correlations Compared With Those Reproduced (R) 
From Manipulated Commonalities 

./■.■■ 



Social Background (SB) 


0 


1 


.5879 


.5219 


.5506 




R 


1 


.5556 


^'5235 


.5442 


Motivation (MTVTN) 


0 




. 1 


.2623 


.4481 




R 




1 


.2820 


.4480 


School Factors (SCH) 


0 






I 


.5674 




R 






1 


.5737 



The application of this algorithm raises two main concerns: (1) 
how to reduce the sheer volume of input and output ^ that we have to 
assimilate when there is more than one admissible solution (after 
all we don't want to have to resort to a new table comparing the 
0*8 and the R's each time); and, (2) how to manipulate the commonality 
coefficients in a systematic way. |.et us deal with each of theke 
in turn. ' \ 



io 



-9- 

\ 

To make the volume of input more comprehensible and to render 
the volume of output more manageable we shall arrange the values 
for our commonality coefficients in columnar form as follows: 

OBSERVED VALUES TRANSFORMED 



Obseirved 



U(Xi) 
U(X2) 
U(X3) 

C(XiX2) . 

CCX^Xg) 

CCXjXg) 

C(XiX2X3) 

RSQXX^X2X3) 

Number of 
Admissible 
Solutions 

\ 

The first column will contain the observed values while the 
succeeding columns will contain various t:rans forma tipns or manipulations 
of these values. There are two ways to perform these transformations • 
The way that we have selected might be called . "sequential resolution". 
By this we mean that the hi^gher order commonality values are split up 
and pushed into the next lo\^er order each time. For each such trans- 
formation the algorithm indicates the number of admissible solutions*. 
\ 

*Ihe number of admissible solute ons is affected by the tolerance limits 
one sets (see Appendix A). Generally, the more lax this, criterion is,' 
the greater are th'e number of admissible solutions. Almost without 
exception, we have used a limit of .005 or less, (see Appendix A). 
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A second way which might be called "bypass resolution", splits 
up the higher order values and relegates them directly to the uniques 
(U(Xi)'s) without passing them through the intermediate olcders • For 
large numbers of variables (five or more) these two approaches may 
not yield the same resolutions (\^iz., oncje all the values, have been 
relegated to the uniqueness, the unique values may differ for the' 
two techniques). We have chosen the former rather than the latter 
because^we have often found the results of each step in a sequence 

to be informative.* ^ x j - 

I 

/ • ' 

What kinds of theory then might we use 'as a guide to our sequential 
resolutions. The one that most readily comes to mind is a' "state of 
ignorance" theory. Such a theory says that we don't know how best to 
allocate the higher order values and therefore that we shall split 
them up so that each variable gets an equal share. In ^ble 1.4 **the 
transformations under the columns numbered 2 and 3 represent these 

resolutions (we have already examined results for the observed 

\^ 

commonalities and those for the fi^rst transformation). Our procedure 
for these was as follows: ^ 
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* We do not know which one of these techniques is optimal. 
** In order to facilitate the paper's readability the reader 
is advised to unstaple the pages and'^pull out *th(a tables 
Q that are being referred to. ^ 
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"State of Ignorance" Transformations 
— Take the simplified values from the first transformation 4^e 
shall use the simplified commonalities for these and all succeeding 
transforpations)^? 
^> 

, T2 - Split C(X3^X2X3) into 1/3 's and put 1/3 on each of 
C(X^X2), 0(5(^X3) and CCX^Xj) 

- T3 - Split each p£ the C(XiXj>;^s(as modified in T2) in half and put 1/2 
. ^ i^ each of its relevant uniques, (e.g., % C(Xj^X2) is added 

to U(Xi) .and the other half taU(X2), perform similar 
operations for CCX^Xg) with respect to U(Xj^) and UCX^); 
] and. for C(X2X3) with respect to U(X2) and UCXg)). 

We can see from inspection of Table 1.4 that T2 did not yield 
any admissible solutions (probably because of the large C(X2X3) value 
produced) whereas T3 yielded five. The correlations obtained from 
these five solutions ar^ given in Table 1.5 along with the inter- 
correlations for feh^fiifSft trar|sformacion and those obtained from the 

^ \ * I ' ^ . 
obsei^ved values. For the ^ fiijst solution, (3A), the regressor inter- 
correlations are all zero. Eadh^^^regressor correlation takes on a very high 
value in turn for the next th|ee ^solxitlons^wiTile l^ie others remain low 
whereas , for the last solution, all the values are high. ^ExScdtia^l^n^^ 
the regression coefficients given in Table 1.6 show that they vary in 
magnitude enormously cv^r the five-solutions. For example, variable 1 
(Social Background) takes on a high value of 2.03 and a low of - 11.25. 
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* We may "reqall that the regressor-regressancf correlations are fixed for 
0_ any single .resolution (or transformation), 
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Variable 2 {Motivation) ranges from 4,76 down to -1,68 while Variable 3 

/ 

(School factors) remains positive all the time but ranges from a low of 
^•43 to a high of 7.06. Clearly then for regressor correlations that 
yield solutions consistent with the fixed regiressor-fegressand relatj^n- 
ships given in Table lls (those are rl4, r24 & r34) , their regression 
coefficients may differ over an enormously, for a fixed RSQ (X^XjX^) 
qf .4417. Our "state of ignorance" theory then allows a great deal of 
variation in the admissible solutions. 

Theory Cuided Trans formattoaa. 

Let us consider next how sa few content oriented theories might help 
us resolve these values. They- are: 

T4 - All the variance that is^accounted for by SB (Social 

Background) is also caused by it. Therefore, take every 
higher order coefficient that has a one in its subscript j 
and add it*s value to the unique value for SB (viz. add 
them to UXXj^)), 



T5 - All the variance that is accounted for by MTVTN (Motivational 
factors) Is also caused by it. Therefore, take every higher 
order coefficient that has a 2 in its subscript and add its 
value to the unique value for MTVTN (viz. add them to U(X2))« 

T6 - All the variance that is accounted for by SCH (School 

factors)« is also caused by it. Therefore, take every higher 
order coefficient that has a 3 in its subscript and add its 
value to the unique value for SCH (viz. add them to U(X3)). 
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Inspection of Table 1.4. shows that these transformations have 5, 2 
and '2 admissible solutions respectively. Let us examine their correla- 
tions in Table i.5 and their regression coefficients in Table 1.6. 

Examination of Table 1.5 shows that the solutions for TA" (our-every^ 
thing associated with Social Background belongs to Social Background 
theory) vary from zero through .58 for rl2, and to about .9 for rl3 and 
r23. Clearly for these regressor-regressand relationships the correlations 
between SGH (School factors) and other factors (either SB or MTVTN) can.^ 
vary to a greater extent than can those between SB and MTVTN. For T5 
(our everything associated with MTVTN belongs to MTVTN theory) the 
correlations between SB and MTVTN and between MTVTN and SCH stay null while 
that between SB and SCH varies from a low of .47 to a high of about .9. 
Not only are these solutions fewer in number but they are also more compact 
(i.e. less variable). For T6 (our everything that is associated with SCH 
belongs to SCH theory) the two solutions are also "compact with the SCH^ 
related correlations staying null while those for SB and MTVTN range from 
about ^52 almost up to 1.0. 

In examining the behavior of the. regression/coefficients for these 
solutions, given in Table 1.6, we can note for T4 that they range from: 
a low of about: .5 to a high of about 1.5 for SB; a high of about .3 for 
MTVTN to a* low of about -.3; and, from a low of -1.3 for SCH to a high of 
about .6. Clearly then the variability is greater over this set of 
solutions for SCH than for any of the others. For T5 the variability is 
least for MTVTN and greatest for SB. For T6 the variability is null for 
SCH but very large for SB and MTVTN. What these results along with those 
In Tables 1.4 and 1.5 suggest is that the variable .or variables which have 

15 
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Table 1.4 Ttansformed Commonality Values and Their Number of ^dmissiMe 
Solutions '■ 



TRAKSFOEMATIONS 



Commonjality 
u06rrxcx6ncs 




1 


2 


■ 3 


4 


5 


6 










1412 


.2961 


.0236 


.0236 


U(X2) 


0307 


■ .0307 


.0307 


.0971 


.0307 


.2007 


.0307 




.1149 


.1149 


.1149 


.2034 


,1149 


.1149 


.3291 1 


C(XiX2) 


.0654 


.0583 


.0955 


0 


0 


0 


.0583 1 


C(XiX3) 


.1025 


.1025 


.1397 


0 


0 


.1025 


^ ! 

) 




nn7i 
— •UU/1 


0, 


.0372 


0 


0 




0 


C(XiXoXo) 
RSQ(XiX2X3) 


.1117 


.1117 


. 0 


0 


0 




O'l 

i 


.4417 


.4417 


.4416 


.4417 


.4417 . 


.4417 


.441^ 

1 


NUMBER OF 

Cf\T TTTTHXTO 

bULUiJLUr^o 


1 


1 


NONE 


■ 




2 


2/ 






TRANSFOPMATIONS 


; 






1 
1 


Conmonallty 


\ 7 


8 


9 


/ 








u \^\J 

U \A,2/ 


.2111 
.1157 


.0236 
.0307 


.0731 
.0546 


/ 

/ ' 






I 




.1149 


.1149 


.1498 


1 






J 
1 


C(XiX2) 


0 


.0769 


.0385 / 








f 

/ 


C(XiX3) 


0 


.1211 


.0606 








/ 


C(X2X3) ' 


0 


.0186 


.0093 








1 

t 


C(XiX2X3) 


0 


.0558 


.0558 








i 
i 


RSQ(XiX2X3) 


.4417 


.4416 


.4417 








t 

I , 


NUMBER OF 
SOLUTIONS 


# 

3 


NONE 


NONE 






1 

1 
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^VARIABLES J' 
1 - Social Background ' (SB) 
" 2 - Motivation (WTVTN) 
3 - School (SCH) 
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•Table 1.5 Regressor and Regressand Correlations Reproduced from 
Commonalities 



TRANSFORMATIONS 



r* 


Observed 


1 


3A 


3B 


3C 


3D 


3E 


12 


.5879 


.5556 


0 


.9827 


0 


0 


.9827 


13 


.5219 


.5235 


0 


0 


.9836 


0 


.9836 


14 


.5506 


.5442 


.3758 


.3758 


.3758 


.3758 


.3758 


23 


.2623 


.2820 


0 


0 


0 


.9353 


.9353 














3116 


.3116 




• 30/3 


• 3/3/ 




.HjJLU 






.4510 


r 


4A 


4B 


4C 


4D 


4E 


5A 


5B 


12'- 


• 0 


.5835 


0 


0 


.5835 


0 


0. 


13- 


0 


0 


.8976 


0 


.8976 


.4709 


.9031 


14 


.5442 , 


.5442 


.5442 


.5442 


.5442 


.3551 


.3551 


23 


0 


0 


0 


.•B158 


.8158 


0 






* 1^1 Jim 




IT'S? 


.1752 


.1752 


.4480 


.4480 














4663 


.4663 


r 


6A 


'6B 


7A 


7B 


7C 






12 


.5192 


.9973 


0 


.9564 


0 






13 


0 


0 


0 


0 


.9555 




\ 

\ 


14 


.2862 


.2862 


.4595 


.4595 


.4595 






23 


0 


0 


^^ 


0 


0 






24 


. .29,83 


.2983 


.3402 


.3402 


.3402 






34 


.5737 


.5737 


.3389 


.3389 


.3389 






♦VARIABLES 














1 
2 
3 


- Social' Background (SB) 

- Motivation' (MTVTN) 

- School (SCH) 




ff 

17 


1 


ff 
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Table 1*6 Regression Coefficients Computed from Reproduced Regressor- 
Regressand Correlations 

TRANSFORMATIONS 





Observed 


1 


3A 


1 36 


3C 


3D 


3E 


1 

2 
3 


.2152 
.2170. 
.3982 , 


.2146 
.2158 
.4005 


.3758 
.3116 
.4510 


2.0305 
-1.6838 
.4510 

1 

f 


-2.0818 
.3116 
^ 2.4986 


.3758 
-.8809 
1.2749 


-11.2492 
4.7600 
• 7.0633 ' 


P 


4A 


4B 


4C 


4D 


4E 


5A 


5B 


1 
2 
3 


.5442 
,1752 
.3390 


.6700 
-.2158 
.3390 


1.234 

.1752 
-.7688 


.5442 
-.3030 
.5852 


1.5198 
.3731 
-1.3295 


.1741 
.4480 
.3842 


.4480 
.7893 


P 


6A 


6B ■ 


7A 


7B 


7C 


• 




1 


.1797 


-2.082 


.4595 


1.5739 


1.5569 







2 
3 



.2050 
.5737 



2.375 
.5737 



.3402 
.3389 



-1.1652 
.3389 



.3402 
-1.1487 



*VARIABLES 

1 - Social Background (SB) 

2 - Motivation (MTVTN) 

3 - School (SCH) 



I 
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the l^nrgest unique values after resolution tend to show less variability 
in their regression coefficients than those with lower unique valties— a 
not surprising result. 

I ■ . , ^ ■ 

Mixed Transformations 

It may also be meaningful to perform mixed transforations - that 
is - transformations that are guided by a theory but at ^ints where the 
theory does not provide guidance a random rationale can be\dopted. Some 
of these are: ^ 

T7 r The SCH factors should have only the value of their 
; uniqueness (UCXj)) but we do not know how the values' 
I for the higher order coefficients should be spread. 

among xSB & MTVTN. Therefore, split CCXj^X^X^) into 1/3 
and add this value to each of the second order coefficients 
(viz. C(X^Xj)). Then add toU(X]^) half of the new CiX^X^) 
and all of the new C(X^X3) . Similarly add to U(X2) half 
of the new 'C(X2^X2) ^nd. all of the new C(X2X3) . ^ , 



T8 - This approach postula^tes that only half of the variance we 
observe in the thitd order can be resolved into the second 
orders, whereas the rest should remain where it is. There- 
fore, take half of C(XiX2X3), split this value into l/3*s 
* ^i.e. a value of 1/6 of C(X'jX2^) and add this much to each 

of the C(XiXj)*s. 

T9 - This approach takes one half of the newly created second 
order values from T8, splits them in one-hail (i.e. 1/4 
of C(X-^2^y 1/4 of C(XiX3)etc.) and adds. each of these to 
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Its appropriate uniqueness. That is, UCX^) gets 1/4 
of C(XiX2) and 1/4 of CiX^X^); U(X2) gets 1/4 of 
C(XiX3) and 1/4 of C(X2X3), etc. 

Inspection of Table 1.4 shows that T7 yielded three admissible solu- 
tions whereas T8 and T9 did not yield any. Tliese results along with those 
for T2 suggest that theories about only part of the shared variance are not 



as likely to yield -admissible solutions. 

i 

Let us examine the nature of the solutions to T7 in Tables 1.5 and 1.6. 
The correlations in Table 1.5 show that the values for SB and MTVTN and 
SB and^SCti range from zero to about *95 but never simutaneously, while 
the c/rrelatlon between MTVTN and SCH stays at zero. The regression 
coefficients for the T7 solutions, in Table 1.6, vary from a low of about .45 
to/a high of about 1.6 for SB,whereas those for MTVTN vary downward from 



-1 " ^ 

.34 to -1.16 and those for SCH vary from about .34 down to -1.15. However, 

jfhe coefficients for MtVtN take on a slightly greater range than that of 

/ 

/ SCH while both of those take on a greater range than those of SB. 

/ 

In this section we have examined a number of different ways that might 
/ be used to resolve higher order commonality values into their lower orders 
for a fixed level of variance explained (viz. /a fixed value of the squared 
multiple correlation). In the. next section we examine results obtained from 

i / 

varying the level of explained variance. 

2. Three Variable Reverse Commonality for Varying Levels of Explained 
Variation 

■> • 
The algorithm's results may show that for a fixed level of variance 

explained (i.e. in our case a fixed value ofRSQ (X1X2X3)) an analysts' 
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.particular resolution does not yield any admissible solutions. As a 
consequence, the analyst may want to explore neighboring levels to see 
if they yield solutions. In this section we examine results in which the 
level of explained variance, is systematically varied. 

These computations were performed by taking the simplified"- commonalities 
(called Tl in Table 1.4), dividing them by their RSQ (Xj^X2X3) value of .4417 
so that they sum to 1.00, and then scaling these values by multiplying them ' 
by:. .1; .2; .3; .4; .5; .6; .7; .8; .9; 1.0 respectively; these values are • 
■given in Table 2.1. We can oliserve that each of these yielded one and only 
one admissible solution.. Examination of the regressor iutercorrelations showed 
that they stayed the same for each scaling condition (these values are given 
in Table* 1.5 under column Tl) . What did change were the regressor-regressand 
correlations for the different scaling conditions. These values, given in 
Table 2.2, sTiow a progressive increase as the scaling factor approached 
1.0.' The regression coefficients, given in Table 2.3, also increased 
progressively in magnitude as the scaling, factor increased. Clearly then 
for the commonality proportions fixed on here, a wide range of RSQ (X1X2X3) 's 
will yield admissible solutions. 
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Table 2.1 Fixed Commonality Proportions and Their Number of Admissible 
Solutions for Increasing Levels of Variance Explained 



TRANSFORMATIONS 



COMMONALITY 
COEFFICIENTS* 


Sr-IPLIFIED 


.1 


.2 


.3 


.4 


.5 


.6 


U(Xi) 


.0236 


.0053 


.0107 


.0160 


.0214 


.0267 


.0321 


U(X2) 


.0307 


.0069 


.0139 


.0208 


.0278 


.0348. .0417 

\ 


U(X3) 


.1149 


.0260 


.0520 


.0780 


.1040 


.1301 


\i561 


C(XiX2) 


.0583 


.0132 


.0264 


.0396 


.0528 


,0660 


. .0792 


C(XiX3) 


.1025 


.0232 


.0464 


.0696 


.0928 


.1161 

1 


.1393 


C(X2X3) 


0 


0 


0 


0 


0 


Oi 


0 


C(X 1X2X3) 


.1117 


.0253 


.0506 


.0759 


.1011 


.1265 


.1517 


RSQCXiXoXq.) 


.4417 


.0999 


.2000 


;2999 


.3999'- 


.5000 


.6001 


NUMBER OF 
SOLUTIONS 


X 




J. 


1 
X 


• 

1 
X 




1 ■ 


COMMONALITY 
COEFFICIENTS* 


.7 


.8 


.9 


1.0 








U(Xi) 


.0374 


.0427 


.0481 


.0534 


I 




f 


U(X2) 


.0486 


.0556 


.0626 


.0695 






1 


U(X3) 


.1821 


.2081 


.2341 


.2601 








C(XiX2) 


.0924 


.1056 


.1188 


.1320 








C(XiX2) 


. .1625 


.1857 


. .2089 


.2321 








C(X2X3) ■ 


0 


0 


0 


0 








C(XiX2X3) 


.1770 


.2023 


...2276 


.2529 








RSQ(XiX2X3) ' 


.7000 


.8000 


.9001 


1.0000 








NUMBER OF 
SOLUTIONS 


1 


1 


1 


3. 









♦VARIABLES 



ERIC 



1 - Social Background (SB) 

2 - Motivation (MTVTN) 

3 - School (SCH) 
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Table 2.2 Regress or -Regress and Correlations Reproduced from Fixed 

Commonality Proportions and Varying, Levels of Variance Explained 



TRANSFORMATIONS 



r* 


OBSERVED 
SIMPLIFIED 


.1 


.2 


.3 


.4 


.5 


.6 


.7 


14 


.54:^2 
.44^ 


.2588 


.3662 


.4484 


.5178 


..5791 


• .6343 


.6851 


24 


,2131 


^ .3015 


.3692 


.4263 


.4768 


.5221 


.5639 


34 


^5737 


.2729 


.3860 


.4728 


.5458 


.6105 

• 


.6687 


.7222 

■ 


r* 


.8 


.9 


1.0 








J 




14 


.7323 


.7768 


.8188 












24 


.6029 


.6395 


.6741 








•i 




34 


Jill 


.8189 


.8632 













i 



*VARIABLES 

1 - Social Background (SB) 

2 - Motivation (MTVTN) 

3 - School (SCH) 
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Hkble 2.3 Regression Coefficients Computed from Admissible Solutions for 
Fixed Commonality Proportions and Varying Levels of Variance 
Explained • 



TRANSFORMATIONS 

.7 





, OBSERVE)!) 
SIMPLIFIED 


.1 


.2 


.3 


.4 


.5 


.6 


1 


.2146 


.1019 


.1445 


.1768 


.2043 


.2283 


.2502 


2 


.2158 


.1025 


.1452 


.1778 


.2054 


.2298 


.2515 


3 


.4005 

■ 


.1906 


.2694 


.3300 


.3810 

■ 


.4262 


.4668 

- 


P* 


.8 


.9 


1.0 










1 


.2887 


.3063 


.3229 






•. 




2 


.2905 


.3082 


.3248 










3 


.5390 


.5716 


.6026 











*VARIABLES 



1 - Social Background (SB) 

2 - Motivation (MTVTN) / 

3 - School (SCH) I 
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We also took the same set of commonality" proportions as are given 
in column 1.0 of Table 2.3, relegated them to the uniquenesses according to 
our theory called T7* in the previous section, and then scaled these 
proportions by increments of .1, ranging from .1 to 1.0. Each one of 
!these yielded 3 admissible solutions which, upon examination, were found to 
have the same regressor values across the different RSQ(XiX2X3) *s. One 
solution was the identity matrix, another had^a .96 in the r 12, location 
and zero's elsewhere while the third solution had a .96 in the r 13 position 
and zero's elsewhere. 

From these two sets of results it would appear that if a set of 
solutions, can be found for fixed commonality proportions and one value of 
an RSQ(XiX2X3), then they can also be found for neighboring and even 
^quite distant RSQ(X]^X2X3) values. There is nothing astounding ab^ut these 
results for they follow directly from the algebra of the algor^^m as set 



ERIC 



forth in Appendix A. 'However, these latter statements are so only for 
commonality proporti^on^ that are all positive. It seems obvious that if 
some of the proportions are negative while others are positive, and, if ^ 
both increase in their absolute magnitudes, then a point (or value of 
RSQ(Xj^X2X3)) will be reached beyond which admissible solutions cannot be 
obtained. 

3. Three Variable Reverse Commonality for Varying Levels of Unique Variation 

In the preceding sections we examined results when: the level of explained 

variance was held constant and the relative proportions of the variables 

were manipulated; and,when the relative proportions were kept fixed and the 

level of explained variance was systematically varied. An analyst may also 

# 

, *This is the theory that SCH factors should^ifl^ve only their unique value 
but, that we don't know how to split -up the higher order commonalities 
and, therefore, a random rationale is adopted* 

25 



want tp examine how large or small a unique value a variable may ^ake on 
and still yield an admissible solution* For example, in the equation 

(12) RSQ(Xi) - C(Xj^X2X^) + C(XiX2) + CCXiXa) + U(Xi) 
the empirical value for UCX^) can be made to take on progressively larger 
or smafler values. Such a manipu^/^tion will alter not only the RSQ for 
that variable but also the RSQ for all three variables combined (as can be 
seen from equation 11 in the introductory section) . In a sense this kind 
of analysis tests how large or hoyr, small a relationship one variable might 
have with a given dependent variable and still be consistently related^, to 
other regressor variables. For this example we have chosen to reduae the 
unique value of one variable to see to what extent this reduction affects 
the dverall results. 

In Table 3 we have systematically reduced the size of the unique 
value for variable X3, which represents the set of School factors (SCH)^ 
The first set of columns in the uppermost part of Table 3 gives the set of 
transformations, as follows: 

Tl - this is a simplification of the observed commonalities 
and is included for purposes of -comparison with the 
suceeding transformations. In previous sections it has 
been called Tl. 

T20 - the unique value for X3 in Tl is reduced by .05 while 
all other values are kept constant. 

T21 - the unique value for X^ in Tl is reduced by .10 while 
all other values are kept constant. 
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T22 - the unique value for in Tl is set equal to zero 
while all other values are kept constant* 

Foir each of these transformations the algorithm . indicated that there 
was only one admissible solution. In the center of the table we can observe 
the trend in these solutions. The correlations between variables: 1 and 2, 
1 and 4; and, ^ and 4 remain the sam^ across the diffeieac wCansformations 

^nd this is to be expected since the coefficientji for these variables were 
not manipulated. The correlation between variables 1 and 3 increased 
substantially and this increase can be attributed to the fixed second 
order coefficient ,(C(XiX3)) and the fixed third order coefficient (C(X2X2X3)). 
In a sense, as the unique variability of a variable is reduced but its common 

' vafiability keptrfixed, its iCorrela[t±on»with the other variables must of 
necessity, increase. TIjis trend also occurs for the correlation between 
variables 2 and 3, but to a lesser extent. The correlation between variables 
3 and 4 decline progressively, which is to be expected since this is the 
relationship we manipulated when we reduced U(X3). . 

It is perhaps of even greater interest to examine the trend in the 
regression coefficients for these solutions. The coefficient for variable 
1 shows a progressive increase, that for variable 2 stays about the same 
while that for variable 3 declines appreciably but does not become zero. 
Clearly then School factors derives a portion of its explanatory power 
from its relationship with the other regressor variables.* 

* We also conducted a series of analyses in which the second order coefficient 
for variables 2 and 3 (viz. C(X2X3)) was made increasingly negative, while 
the other values were kept constant. These analyses did not yield any ^ 
admissible solutions. However, when the negative ♦coefficient for C(X2X3) , 
was kept at its observed value (see Table l.'l) and the coefficients were , 
unitized and then scaled by increments of -1, from . 1 to 1.0, a single 

admissible solution was obtained for each such transformation." 
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Table 3 Reduced Unique Values and their Effects 



TRANSFOmTIONS 



'^OMMOKALITY 
COEFFICIENTS* 


OBSERVED 
SIMPLIFIED Ki') 


ZO 


21 


99 




.0236 


• 0236 


•0236 


• UZJO 


U(X2) 


♦0307 


•OjO/ 


• UJU/ 


• IIJV# 


U(X3) 


.11A9 


•06A9 


•01A9 


0 


C(XiX2) 


.0583 


•0583 


•0583 


♦0583 


C(XiX3) 


♦1025 


• 1025 


• 1025 


•1025 


C(X2X3) 


d 


0 


0 


0 


C(XiX2X3) 


•1117 


•1117 


;1U7 


♦1117 • . 


tSQ(XiX2X'3) 


•4A17 


• 3917 


• 3A17 


♦ 3268 


NUMBER OF 
SOLUTIONS ' • 


1 


1 


1 


1 


REGRESSOR-REGRESSAND CORRELATIONS 








OBSERVED 
SIMPLIFIED (1) 


TRANSFORMATIONS 
20 21 


22 



12 
13 
lA 
23 
2A 
34 



B* 



♦5556 
♦5235 " 
♦5AA2, 

♦ 2820 
♦4480 

♦ 5737 

REGRESSION COEFFICIENTS 



OBSERVED 
SIMPLIFIED (I) 



♦ 5556 

♦ 59AA 

♦ 5AA2 

♦ 311A 

♦ 4A80 

♦ 5283 



♦ 5556 

♦ 7252 

♦ 5AA2 

♦ 3528 
♦AA80 
♦A786 



TRANSFORMATIONS 

20 21 



♦ 5556 

♦ 8505 

♦ 5AA2 
♦3688 
♦A^80 
♦Al 28 



22 



♦21A6 
.2158 

.A005 



♦ 2301 

♦ 2195 

♦ 3232 



♦ 2732 
♦2253 

♦2011 



♦ 3A89 

♦ 2233 

♦0837 



★VARIABLES 



1 - Social Background (SB) 

2 - Motivation (MTVTN) 

3 - School (SCH) 
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In the next section we examine results of the algorithm for 
larger numbers' of variables. 

4# Seven Variable Reverse Commonality 

As the number of regressor variables increase so too do the 
potential niimber of admissible solutions and their corresponding 
•vo3'ui?e of output. For example, in an analysis with 4 regressor 
variables one resolution yielded 16 admissible solutions for 

5 regressors one resolution yielded 52 admissible solutions; and, for 

6 regrqssprs a single resolution yielded 193 admissible solutions. 
Headless to" say, we have no desire to report on that many results. 

4 

However, a certain number of difficulf:ies do arise when working with 
large number of regressors and since the number of admissible 
solutions were fairly low for our 7 regressor case we have chosen to 
deal with some of these results in this section (seven regressor 
variables happens also to be the maximum number that the algorithm 
can currently accomodate). 
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The variables used- In these analyses are composites or indices as 
developed in Mayeske, et.al., 1972. They are: ^ 
1* Socio-Economic Status / 



i 



2. Family Structure and Stability 

3. Ethnic Group Membership ' 
. '4. Expectations for Excellence 

5. Attitude Toward Life - / , , 

6. .Study Habits / 

7. School Factors (the student body's Achievement Level) 

8. «^Iridivtdudl ^Student ^Achievement 

• ■ ' I / 

The first 7 variables are us^d as regressors while the eighth is the 

regress and (or dependent variable) . J 

\ 1 
First we should recall that with 7 variables one obtains 127 different 

commonality coefficients (2^-1 where m Is the number of variables) . This 

is usually too large a number to work with even if the volume of output were 

small. However, we decided to submit our observed results of the 7 variable 

" /i / 

commonality analysis to the Reverse Commonality algorithm to* ^ee if it wou^d 
yield a single admissible solution. Actually, it yielded 3/ admissible 
solutions all very similar to one another and to the observed correlations , 
save for a few variations in the magnitude of the correlation between , 
variables 4 and 5 and, between variables 4 and 6, as can^ be seen from Table. 
4.1. The regression coefficients obtained from 

30 



-29- 

T«ble 4.1 Regreaaor and Rcgressand *Cor relations Reproduced from Seven 
Variable Coomonalltles 



' REPRODUCED FROM REPRODUCED FROM SIMPLIFIED 

OBSERVED COMMONALITIES COMMONALITIES 



r* 


OBSERVED 


1 


2 


3 


1 


2 


3 


4 


12 


.3567 


.3574 


.3574 


.3574 


.3703 


.3703 


.3703 


.3703 


13 


.3714 


.3719 


.3719 


.3719 


.3748 


.3748 " 


.3748 


.3748 


14 


.2787 


.2793 


.2793 


.2793 


.2740 


.2740 


.2740 


.2740 


15 


.3736 


.3739 


.3739 


.3739 


.3795 


.3795 


.3795 


.3795 


16 


.3968 


;3972 


.3972 


.3972 


.4120 


.4120 


.4120 


.4120 


17 


.4346 


.4345 


.4345 


.4345 


.4472 


.4472 


.4472 


.4472 


18 


.4973 


.4974 


.4974 


.4974 


.4850 


.4850 


.4850 


.4850 


23 


.2999 


.3007 


.3007 


.3007 


.3019 


.3019 


.3019 


.3019 


24 


.3725 ' 


- .3734 


.3734 


.3734 


.3839 


.383^ 


.3839 


.3839 


25 


- .4707 , 


.4707 


.4707 * 


.4707 


.4854 


.4854* 


.4854 


.4854 


26 


.4723 


.4715 


.4715 


.4715 


.4967 


.4967 


.4967 


.4967 


27 


;2908 


.2894 


.2894 


.2894 


.3032 


.3032 


.3032 


:3032 


28 


.3259 


.3260 


.3260 


.3260 


.3028 


.3028 


.3028 


.3028 


34 


.1684 


.1678 


.1678 


.1678 


.1550 


.1550 


.1550 


.1550 


35 


.2253 


.2253 


.2253 


.2253 


.2146 


„ .2146 


.2146 


.2146 


36 


.2219 


.2217 


.2217 


.2217 


.2101 


.2101 


.2101 


.2101 


37 


.6165 


".6173 


.6173 


.6173 


.6420 


.6420 


.6420 


.6420 


38 


.4908 


.4909 


.4909 


.4909 


.4784 


.4784 


.4784 


' .4784 


45 


.5172 


.5143 


.7782 


.5143 


.5699 


.7148 


.5699 


.7148 


46 


.4894 


.4876 


.4876 


.8264 


.5451 


.5451 


.7639 


.7639 


47 


.1558 


.1553 


.1553 


.1553 


.1562 


.1562 


.1562 


.1562 


48 


.2552 


.2546 


.2546 


.2546 


.2356 


.2356 


.2356 


.2356 


56 


.6623^" 


.6603 


.6603 


.6603 


.7112 


.7112 


.7112 


.7112 


57 


• 2321 


.2312 


.2312 


.2312 


.2306 


.2306 


.2306 


.2306 


58 


.3820 


.3818 


".?818 


.3818 


.3635 


.3635 


.3635 


.3635 


67 


.2331 


.2327 


.2327 


.2327 


.2344. 


.2344 


.2344 


.2344 


68 


.3674 


.3670 


.3670 


.3670 


.3523 


.3523 


.3523 


.3523 


78 


.5674 


.5673 


.5673, 


.5673 


.5598 


.5598 


.5598 


.5598 



* Tlte varlablea are: 1-Soclo-Economlc Status; 2-Farally Structure and 
Stability; 3-Ethnlc Group Membershln; 4-Expectatlons for Excellence; 
5-Attltude Toward Life; 6-6tudy Hablta; 7-School Factors; 8-Indlvldual 
Student Achievement 



Table 4.2 Standardized Regression Weights from Observed and Reproduced 
Commonalities for Seven Variables 



REPRODUCED FROM OBSERVED REPRODUCED FROM SIMPLIFIED 
CaiMONALITIES COMMOIJALITIES 



/?* 


OBSERVED 


1 


2 


3 


1 


2 


3 


4 


1 


.2116 


.2116 


.2120 


.2091 


.2084 


.2079 


.2057 


.1998 


2 


.0050 


.0056 


.0086 


.0068 


-.0203 


-.0189 


-.0205 


-.0256 


3 


.1626 


.1622 


.1632 


.1636 


.1527 


.1529 


.1535 


.1542 


4 


.0104 


.0103 


-.0738 


-.0623 


—.0071 






1 1 An 


5 


.1328 


.1332 


.1957 


.1335 


.1377 


.1677 


.1382 


.1777 


6 


.0766 


,0762 


.0747 


.1333 


.0737 


.0753 


.1082 


.1350 


7 


.3235 


.3235 


.3208 


.3213 


.3268 


.3258 


.3259 


.3244 


RSQ 


.4517 


.4516 


.4537 


.4528 


.4253 


.4264 


.4262 


.4298 



* The variables are: l-Socio-Economic Status; 2-Family Structure and 
^•Stiibility; 3-Ethnic Group Mmber ship; -4- Expectations for Excellence; 
5-Attitude Toward Life; 6-Study Habits; 7-School Factors; 8-Individual 
Student Achievement 
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• these solutions, given in Table -4.2, show that they tend to be very similar , 
. to 'those obtained from the observed correlations save for variable 4 

(Expectations for Excellence) which takes on small negative values and 
- variables 5 (Attitude Toward Life) and 6 (Study Habits) which take on 
slightly larger values for the second and third solutions, respectively. 

Next, in order to reduce the sheer number of coefficients, we set 
equal to zero any empirical value that had zero's for its first three 
decimal digits.* The algorithm indicated that there were four admissible 
solutions for this simplified set of. coefficients. The correlations and 
regression coefficients obtained from these solutions are given in the 
last four columns of Tables 4.1 and 4.2 respectively. Inspection of 
these values shoys them to be virtually identical to one another except 
for the correlations between variables 4 and 5 and variables 4 and 6. 
These latter alternate in taking on higher or lower values. Although 
these correlations are not very different from one another they do differ 
enough from the observed correlations to result in somewhat different 
values for some of the regression coefficients, as can be seen from Table 4.2. 
This is especially so for variables 2 (Family Structure) and, 6 (Study 
Habits). It appears to be these attitudinal and motivational variables 
then that show the greatest sensitivity to simplification of the commonalities. 

In an attempt to further simplify the number of empirical values one had 
to work with we set equal to zero any of these commonalities that had zero's 
in the first two decimal digits.** For thes6 simplifications the algorithm 
indicated that there were not any admissible solutions. + We then took these 

*e.g. .0004 would be set to zero but. 0040 would not. 

**e.g. .0040 would be set to zero but .04000 would not. 

+However, when the criterion was relaxed to a value of 1.0, the algorithm 
indicated that there were 20 admissible solutions (see Appendix A). Whether^ 
or not it is desirable to relax this criterion will depend upon the analysts 
gj^Q objectives. «j3 
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further simplified values, split them up and allocated them to the unique 
Values in two distinctly different ways* For each of these the algorithm 
indicated that there were 15 admissible solutions* We shall not. dwell on 
the nature of these results other than to note that the regressor correla- 
tions often varied over a considerable range. Some of them assumed values 
in the range from zero to as high as .8 or *9. 
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5. Summary 

In this paper we have examinesd the results of an algorithm which 
is designed to take a set of commonality coefficients., either real or 
manipulated, and, if possible, produce one or more sets of regressor 
correlations that are consistent with them, A number of different ways 
of resolving the higher order commonality values into their lower orders 
were tried and the number and nature of solutions generated from them 
were examined for their meaningfulness and variability. In general it 
was found that this could be a meaningful exercise because it allows an 
analyst to test his assumptions about the nature of the' confounding to 
see if a solution can be obtained. If one cannot be obtained then his 
assumptions must be revised. But if one can be obtained, then he can examine 
them to see which kinds of variables have the greatest sensitivity to the 
assumptions* The resulting output also allows one to gauge the variability 
of regression coefficients that will satisfy the same set of commonality, 
values and the effectiveness of .the regression system. 

This technique called "Reverse Commonality" is best suited for an 
interactive computing arrangement so that an analyst can use it rapidly 
In a sequential manner. Since, for a large number of variables the sheer 
volume of commonality values becomes unmanageable and, since a number of 
different ways are available for resolving the confounding, an algorithm 

Is needed which uniquely ^resolves the higher order values according to 

» - / 
one's assumptions. 

/ 
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Reverse Commonality (REVCOM) 



1, Discussion 

Let us assume that there Is a variable y which has been used 
In a regression analysis In which variables (j«l,2, . . . » have 
been used as predictors. Let us further assume that the statistics 
available to us from the regression analysis Is ^either the common- 
alities o;r squared multiple correlations of y wlth^each possible 
combination of predictors. The signs of the correlations of indi- 
vidual X. with Y are known* Our prob'fem is to compute the Inter- 
correlations among the predictors from the known simple and multiple 
correlationa . 

Thia problem must have at least one valid solution if the 
known coefficients were computed from reil data. There need not be 
a unique solution, however, and in the* two predictor case there 
will ordinarily be tw^ solutions. Early experience with test 
problems Indicates that ther^ are not very many solutions for 
multl-predlctor systems. J/ artificial commonalities or squared ' 
multiple correlations are supplied" then there may be no solutions 
Indicating that the. correlations are not Consistent with any 
possible set of real data. 

This paper presents an algorithm for computing all possible 
Interpredictor correlation matrices from a given vector of common- 
alities and the signs of the simple correlations between y and the 
x^. Commonalities can be simply converted to squared simple and 
multiple co-r relations. We will use only the more familiar multiple 
correlation coefficients for discussion here. 

■37 
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Discussion 



The information, is & vector of all possible squared tnultipl* 
correlations of a dependent vairiabla y with the independent 
variables Xj (j«l ,2, . . .m) which is created froa the commonalities. 

There are 2"-l such correlations which will be indicated by subscripts 

.2 X.2 «2 «2 -2 

V •y2'---V' V2» V23---Sl23...ii- 

The signs of the simple correlations r^j are also known. 

We wish to compute the simple correlations r (i»l,2, . .m-1; j-i+1, 
i4^2,....m) among the dependent variables. There are m(m-l)/2 such 
correlations. ^ . * . 

The general strategy is to compute the possible values of r 
th'«/t' are / consd^^^nt with* rj^, r^j , and R^^j-, then to check these Tor 
consistency with each other known correlation, then each^ other pair 
of correlations, and so- forth unffil they are demonstrated to be 
consistent with all other variables collectively ♦ If a possible r.. 
does not pass every such- test It is eliminated from further con- 
sideration. ' • 

■The first step is to compute the initial set of possible values 
for the squared. multiple correlation for predicting y from x* and Xj ' 
may be written 
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which may be solved for r^^ by 
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Depending on the value of the value Inside the radical^ r.. may 
have either no solution, one solution or two solutions. I£*th«'. 

value of the radlcand Is negative there is no possible r that 

'2 

could combine vlth r^^ and r^j to create «i>ch a ^yji^j thus the 

given values could not come from real data. In 'the rare case 
that the radlcand is zero, r.. is unique* If the radlcand is 
positive then there are exactly two solutions which ve will refer t 
as r..-^ where S . . ■ + or - dep.ending -on the sign of the radical* 

11 

By computing r^^"* for all i and J we have computed all r^j ' 

that are possible in two variable prediction*. Since there are up 

•to 2 possible values for each x.. there are 2^^^"^^^^ possible 

combinations, of. r^^^ thus; that same number of« Intercorrelatlon^ 

matrices which must be considered* * Many of these may be eliminated 

tlirough evaluation of higher order relationships* To investigate 

higher otder relationships we have two checks: / 

^ ' (1) Are the intercorrelatlons internally consistent? and 

' . (2) Does this combination of intercorrelatlons produce the 
. corr.ect higher fordcr multiple correlation? 

V ' ' 

These two questions can be answered by standard multiple re- 
gression techniques* The SWP operator (Beaton (1964))* is a simple 
method of implementing these computations* Basically, the procej^s 
begins with 3 predictor regressions in which case there are three 
off-diagonal co-relations and up to 8 possible' combinations of 

intercorrelatlons* Each is examined in order by forming a 4x4 

* 

correlation .matrix (including the 3 Independent variables and y) 

and sweeping out the Independent variables* If the SWF operator 
h 
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encounters a negative pivot then this combination Is Internally 
Inconsistent* If the multiple correlation of ttiese .and y Is- 
not the knoHn value within a tolerance limit, theja this combination 
'does not fit the problem and Is rejected. After computing the 
(up to) 8 combinations only the r^^"^ that are In at least one 
acceptable combination are retained. Our e.xperience indicates' 



^that m^ny of the potential correlations are 



lost at this point. If 



no combiivation is acceptable then there can* be no solution to these 
problems and thus the given multiple correlations cannot come fron 
'real data. 

After trying all 3 variable combinations, all 4 variable, 5 
variable, and higher order combinations are tried until finally the 
m variable regressiort* is pe.rformed. At this point the problem is 
solved since the remaining matrices have i3een demonstrated to have 
a positive determinant and fit all given, multiple correlations. 

If the ^algorithm worked through all possib'le combinations the 
computer costs might be prohibitive for large m, The algorithm 
saved considerable time by app^achlng the problem by successive 
elimination. The 3 variable phase usually reduces the potential ^ 
^Ij substantially 'so that there are usiKalljr qot too many 4, * 
variablfe combinations worth trying; and. the 4 variable combination 
docs the same for tjie 5, and so forth. \ ■ ^ 



*Beaton,' A. .E. The .Use of 'Special Matrix Operators In 
Statistical Calculus* Research Bulletin 64-51, 
Educational Testing Service* October, 1964. 



REVCOM PROGRAM FLOW ^ ' ' ^ 

(1) Read in prbblco control card (if blank card to to <8)). 

NX » I of independent varialbes. 

NV ^ NI+1 (program adds on the depertdent variable)* 

(2) Read in signs to be used for the independent vs dependent 
correlations* 

(3) Call subroutine »CRSQ'.» 

(a) Read in inputted 'commonality table. 
, (b) Compute and print all possible R squares (2^^-l) • 
* (c) Return -to' main program. 

(4) Develop the two-va^jued NVxNV correlation matrix using the 
R square table. Report any dis ctcpancies . 

(5) If any correlation cannot be "developed go to (1). 

(6) Call subroutine »ELIM'.. ' - 

(a) Set NVB -3* 

' tb'X^ iSelect-one -corabination of NI variables taken NVB 

at a time. If all combinations computed go to (f ) ♦ 

(c) For these selected NVB variables set up all possible 
correlation matrices from the two-valued matrix 
computed in step (4) • .Compute the R square for 
each of these matrices. 

(d) If one of the two Values for any of the NVBx(NVB+l)'/2 ' 
possible correlations never gave an R square solution/ 
th*t checked against the table .computed in step (3.b) , 
eliminate it* If both values are eliminated stop the 
program* 

(e) If NVB-NI print each matrix that checks against the 
-R square table along with its inverse and label it 
a 'GOOD* case. 

(f) Go to (b)* 

Cg) Print out the two valued correlation matrix showing 
which of the NIx(NI+l)/2 x 2 values have been 
elimated by the NVB \rder elimination. 

f • 

<• • • 

41' - ■ 
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(h) NVB«NVB+1 if NVB<NI go to (b) . 

(i) Return to main progr«a« 

(7) .Go to (1) . . ' 

(8) ENdJ- 
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3. REVCOM Program Instructions 



Purpose - Given a commonality table derive the original 
> correlation matrix. ' ^ 



Input 



1) Control Card 
Cdl8« 



. Var . 
N 



6-60 



Title 



Explanation 
£ 7 Number of Independent 
^.^junrlables In the commonality 
tVble. 

;Any heading Information. 



.2) 'Sign-control card and K square tolerance 



Cols. 



3 
H 



.Var. 



Explanation 
or (Thla Is' the sign 

of the correlation between the 
flrat Independent var. and* the 
dependent var. 

or (sign of the 

cor^relatlon between the second 
independent var. and the de- ' 
pendent var. 
••f»/or (3rd sign) 

»t*t or (Ath sign) 

•+♦, or (Nth sign) 
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/ 


/ 

/ 


i 

( 
i 

/ ; ■ ■ 


1 
f 




/ Cols. / Var. 


Explanation. 


i 




[opf/ional] 11 


-20 • . TOL 


Thia i» the largest absolute 










deviation i allowed when checking 

1 

the derived R-squares against 

the R-squ;are computed from the 

( 

^commonality table. If columns 




i 




» 


• 


ll«-20 art left blank the pro- 








• * f 
/ 




gram will "use a default value 
of .OOsJ 








3) Commonality Table* 


A 








.The commonality table consiats of 2^-1 input cfiTds. There 








la one card input for each 


uniqueness and each commonality 








combination. The card contains the commonality variable 






/ 

/ 


numbers identifying the combination as ^ell as the 






commonality value. The first order commonmition are rea^ 








In first, followed by the 


possible second order , t^hird , etc. 








EX. 


Four Variable 


Problem -(2^-1 - 15) 








, • 1 


-«xx 










2 


XXX 










3 


XXX 










4 


XXX 










12 XXX 










% 13 XXX 










14 • XXX 










23 / XXX 










* 24 / XXX 










34 / - XXX 










' • 123 / XXX 










124 / . XXX 










134 / . XXX 








• 


234 / XXX 










1234 \ XXX 


• • 
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^SISjl V^r* Explanation 

" Conmonallty Level 

• 11-20 Conmonallty Value (F10.4) 

If another problem is to be run repeat steps 1-3, If this 
1» the last problem insert a blank card after the input to bring 
thexproblem to a normal termination. 

;'K . • ■ 

\ 

V -7, 
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